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^ ; ABSTRACT 

We present a series of colour evolution models for Luminous Red Galaxies (LRGs) in 
the 7th spectroscopic data release of the Sloan Digital Sky Survey (SDSS), computed 
using the full-spectrum fitting code VESPA on high signal-to-noise stacked spectra. 
The colour-evolution models are computed as a function of colour, luminosity and 
redshift, and we do not a-priori assume that LRGs constitute a uniform population 
of galaxies in terms of stellar evolution. By computing star-formation histories from 
the fossil record, the measured stellar evolution of the galaxies is decoupled from 
the survey's selection function, which also evolves with redshift. We present these 
evolutionary models computed using three different sets of Stellar Population Synthesis 
(SPS) codes. We show that the traditional fiducial model of purely passive stellar 
evolution of LRGs is broadly correct, but it is not sufficient to explain the full spectral 
signature. We also find that higher-order corrections to this model are dependent 
on the SPS used, particularly when calculating the amount of recent star formation. 
The amount of young stars can be non-negligible in some cases, and has important 
implications for the interpretation of the number density of LRGs within the selection 
box as a function of redshift. Dust extinction, however, is more robust to the SPS 
modelling: extinction increases with decreasing luminosity, increasing redshift, and 
increasing r — i colour. We are making the colour evolution tracks publicly available 
at |http : //www. icg.port . ac .uk/~tojeiror/lrg_evolution/[ 
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• • . 1 INTRODUCTION 



In the currently favoured model of galaxy formation, struc- 
ture builds up hierarchically and galaxies build up their stel- 
lar mass within host dark matter haloes via a complex mix- 
ing of two processes - star formation from cold gas, and 
merging. This model is supported by observed large-scale 
structure and its evolution, which can be well understood 
and remarkably well modelled within the A Cold Dark- 
Matter scenario (LCDM) . Because of the complicated astro- 
physics involved, the link between mass build-up and star- 
formation is not direct. There are two main observables with 
which to understand this link: stellar light, which holds in- 
formation about the formation history of the stars present in 
any given galaxy at the time of observation, and direct ob- 
servation of size, density and clustering of individual galaxies 
and their evolution with redshift. The combination of these 
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two observables has the potential to resolve a long-standing 
quest of galaxy evolution - to match samples of galaxies at 
different redshifts, without making assumptions about their 
stellar or dynamical evolution. 

Luminous Red Galaxies (LRGs) , for which typically the 
label of Early- Type Galaxy (ETG) also applies, occupy an 
interesting position within the galaxy evolution puzzle. They 
occupy and dominate the high-mass end of the galaxy mass 
function and are predicted to occupy the most massive dark 
matter halos. According to LCDM, these halos must have 
started their assembly earlier, and such a signature is indeed 
seen in the galaxy stellar populations - LRGs are clearly 
dominated by old stars, with 70-90 per cent of the present 
stellar mass having been formed at z > 1, with the de- 
tailed measurement de pendent on th e stu dy and exac t sam- 
ple dWake et al.ll2006l; ferown et al.ll2007l : ICool et al.ll2008l ). 
Ivan Dokkum et "ail (2010) have extended this to higher red- 
shift, and measured growth of massive galaxies - albeit not 
exclusively red galaxies - as factor of 2 since z — 2. They 
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concluded this growth was mostly due to mergers, rather 
than star formation. There is also evidence that more mas- 
sive LRGs have, on average, older stellar populations, giv- 
ing strength to the hierarchical forma tion scenario (e.g. 
iThomas etai1l2005l ; Ijimenez et al.ll2007T ). What is not clear, 
however, is why star-formation seems to have shut down 
in t hese massive gala xies since roughly a redshift of two 
(see iPeng et al I l2010l for a recent proposal, and references 
within). The low-level of recent star formation suggests that 
these galaxies have either been dynamically passively evolv- 
ing, or that any mass build up through merging in the recent 
Universe must have occurred through either dry merging (re- 
sulting in no new star- formation), or minor mergers (perhaps 
inducing only very small levels of star formation). It is also 
possible that the recent star formation observed in LRGs is 
due completely or in part to the gas returned by stars in 
the giant branch (AGB) to the interstellar medium. Given 
the low levels of star formation activity in these galaxies, 
such a source of gas would provide a natural explanation for 
the low level of star formation activity observed in LRGs at 
z < 1. 

Measuring the mass build-up of massive galaxies is 
therefore a crucial and stringent test of models of galaxy 
formation. In spite of being a puzzle in itself, the process 
that shut down star formation in massive early-type galax- 
ies has made this particular population extremely attractive 
for observational cosmologists. The fact that they are lumi- 
nous and are thought to have a simple stellar evolution make 
them easy to target and observe at a range of redshifts. The 
ability to predict the colour evolution as a function of red- 
shift ensures an efficient allocation of fibres or slits for spec- 
troscopic measurements, and the fact they are very massive 
makes them observable to larger distances. Finally, LRGs 
are often approximated as being a uniform population of 
galaxies that experiences no significant amount of merging. 
This is attractive as the computation of their predicted den- 
sity and velocity bias evolution then becomes trivial |Frvl 
1996), enabling more information to be recovered using the 
clustering of this population to form a standard ruler to 
measure cosmological geometry or using them to measure 
Redshift- Space Distortions. 

There is a wealth of literature on LRGs and ETGs, their 
properties and evolution. Studies have been performed based 
on (s ee also references within): the mass or luminosity func- 
tion dWake et al.ll2006l ; iBrown et ai1l2007l ; iFaber et al 1l2007l; 
Cool et all 120081); col our -ma gnitude diagram (|Cool et al.l 



20061; iBernardi et all 



2010 ); photometry SED fitting 
( Kavirai et al.1 120091 ; IMaraston et~ai1 120091 ); absorption 



line fi tting to indiv i dual galaxie s ' spectra dTrager et al.l 
|2000| ; IThomas et al] 120051. |2010|; ICarson fc Nicholl I2OI0F) 

1 di 



or to stacked spectra (|Eisenstein et al.l 120031; iGraves etal 



200E ; |Zhu et al.l l201dh : f ull spectral fitting l|jimenez et al 
2007) ; close-pair counts |Bundv et al. 120091') and clusterin , 
ilSheth et alT|2006l ; IMasiedi et al.l l2006l; IConrov et alll200 
White et all |2007|; iMasiedi et al.1 |200S|; IWake et al.1 1200 
Toieiro fc Percivalll2010l ; |Pe Propris et alj|20ich ~ 



These papers can be broadly split into two types: those 
that aim to gain knowledge of the stellar content and evo- 
lution of these galaxies, and those that are interested in 
their dynamical evolution, or merging history. The picture 
seems set in its broad terms, but there is disagreement in 
the details. It is well established, from the work cited above 



and many more, that ETGs constitute a very uniform pop- 
ulation of galaxies; are dominated by old and metal rich 
stellar populations; their mean ages (either mass- or light- 
weighted) decrease with luminosity; and the most luminous 
occupy more dense environments. There is, however, an in- 
creasing amount of evidence pointing towards some amount 
of recent star formation in intermedia te-mass ETGs com- 
ing, e.g. from UV e xcess measurements (|Kavirai et al. I l2007l ; 
ISalim fc Richl koiO). This amount of star formation is not in 
confli ct with the h i erarch ical model of structure formation, 
and iKavirai et all (|2010h . through evidence coming from 
small morphological disruptions in early-type galaxies, ar- 
gue that it can be explained from the contributions from 
minor-mergers. The overall dynamical evolution is best con- 
strained via a clustering analysis, although this tradition- 
ally involves assuming a model for the stellar evolution of 
th e galaxies in the sample. Measurements vary (see Table 4 
in lToieiro fc Percival|[2O10l for a summary), but luminosity 
growth seems to be confined to less that 20 per cent since a 
redshift of 1. 



1.1 This work 

The motivation for t his work is two fold. In 
iToieiro fc Percivall (|2010h we constrained the dynami- 
cal evolution of Sloan Digital Sky Survey (SDSS) LRGs by 
assuming the stellar evolution of these galaxies was known, 
and followed the passive evolution model (with a small 
metal-poor component) of IMaraston et al.l ( 20091 ) (M09). 
Given the evidence that ETGs have different histories 
according to their luminosity, as well as evidence for recent 
star formation in at least intermediate-mass ETGs, the 
assumption of passive evolution after a single star burst for 
all LRGs is clearly an oversimplification. The first of our 
goals is therefore to provide the community with empirically 
derived colour evolution models for the SDSS LRG sample 
that depend on luminosity, colour and redshift. 

Secondly, our approach of using the fossil record to in- 
fer their star-formation and metallicity histories, as well as 
dust content, and from there compute their colour evolution 
as a function of redshift makes minimal prior assumptions 
about the evolution of the galaxies. The method adopted 
is to compute the star-formation and metallicity histories 
using VESPA |Toieiro et alj|2007t ) - a full-spectrum fitting 
code that assumes no prior information about the star- 
formation or metallicity history of a galaxy other than it 
is non-negative. Non-parametric full-spectral fitting has ad- 
vantages and disadvantages when compared to other meth- 
ods. The main advantage comes from a well time-resolved 
and completely unconstrained star formation history, which 
does not suffer from any problems and biases tradition- 
ally associated with SSP- equivalent ages and analyses (e.g. 
iTraeer fc Somerville|[2009l ) . On the other hand, full-spectral 
fitting can be more sensitive to inaccuracies in the mod- 
elling (particularly dust), and spectrophotometric calibra- 
tions. Parameter degeneracies are not a problem, in the sense 
that they can be estimated and incorporated into any anal- 
ysis. 

There are also clear advantages in using the fossil record 
to compute the colour evolution of a sample of galaxies. 
Primarily, one does away with the circular methodology of 
pre-selecting a sample across a range of redshifts that one 
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believes is itself a uniform population of galaxies. This is 
traditionally done according to an evolutionary model and 
after applying the necessary K+e corrections and survey se- 
lection functions, and culling the observed sample to one 
that is self-consistent (i.e., any galaxy in this culled sam- 
ple could have been observed at any redshift probed by the 
survey, according to a model). The obvious danger of such 
an approach is that the deductions about the evolution of 
the sample simply reflect the assumptions used to define 
it. In our approach, however, evolutionary colour paths that 
stray from the colour selection box are acceptable. We there- 
fore de-couple the sample of galaxies observed at any given 
redshift from galaxies observed at earlier epochs, and we 
are insensitive to changing survey selections. Of course, if 
one wants to interpret this evolution in terms of the overall 
evolution of a population of galaxies then sample selection 
becomes crucially important. We leave that for a series of 
follow-up papers. In this work we introduce a new set of em- 
pirical models for the evolution of galaxies labelled as Lu- 
minous Red Galaxies, as a function of their observed colour, 
luminosity and redshift. 

This paper is organised as follows. We start by describ- 
ing the dataset used in this work in Section [2] in Section [3] 
we describe in detail the several steps of our methodology; 
in Section 0] we describe the three sets of SPS models used 
throughout the paper; we follow by presenting our results 
in Section [5] which we interpret in Section [6] and we finally 
summarise and conclude in Section [7] 
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Figure 1. The redshift - absolute magnitude diagram of the DR7 
LGRs used in this paper (random subsample of 20,000 objects). 
The magnitudes are rest-frame r— band absolute magnitudes, and 
are k-corrected only. The red lines show the limits that split our 
sample into 4 magnitudes bins at each redshift (see Section [2] for 
more d etails), k-corrections wer e calculated using the k-correct 
code of lBlanton &: Roweisl (2007). 
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2 DATA 

The SDSS is a photometric and spectroscopic survey, car- 
ried from a dedicated 2.5m telescope in Apache Point, New 
Mexico. Photometry was taken in five bands: u,g,r,i and 
z, corresponding to central effective wavelengths of 3590A, 
4819A, 6230A, 7640A, and 9060A respectively. For details 
on the har dware, software and data -reduction see lYork et al.l 
(2000) and lStoughton et al l (|2002T ). In s ummary, the surve y 
is carried out on a mosaic CCD camera (|Gunn et al.l fl998). 
two 3-arcsec fibre-fed spectrographs, and an auxiliary 0.5m 
telescope for photometric calibration. 

Objects were selected for spectroscopic follow-up ac- 
cording to two main targetin g algorithms. The main galaxy 
sample l|Strauss et al.l I2002T ) is a magnitude-limited, high- 
completeness (>99%) sample, selected in the r— band. The 
targeting is done down to r = 17.77, resulting in a median 
redshift of around z — 0.11. The luminous red galaxies sam- 
ple extends the redshift range to 0.15 < z < 0.5 by targeting 
luminous and very red objects acco r ding t o the target algo- 
rithm described in Eise nstein et aU |200ll ). 

In this pa per we analyse the late st SDSS LRG sample 
(data release 7. lAbazaiian et afl|2009l ). with a spectroscopic 
footprint of around 8000 sq. degrees and which includes 
around 180,000 objects. Fig. [T] shows the r— band absolute 
magnitudes as a function of redshift for a random subsam- 
ple of our galaxies. Th e k-corrections wer e calcu lated us- 
ing the k-correct code of lBlanton fc Roweisl (|2007l 'l. Strictly, 
to be self-consistent and completely model independent, we 
should derive k-corrections from the observed spectra. How- 
ever, the difference between this approach and using the 



-21 - 



-22 - . 



-23 



-24 
10.5 




11.0 11.5 12.0 12.5 
og M,/M solar 



13.0 



13.5 



Figure 2. Absolute magnitude vs. stellar mass for the LRG sam- 
ple. The stellar m asses are from the public VESPA database 
llToieiro et alj|2009h . The error bar at the top-right corner shows 
a typical error on the stellar mass. 



model fits of the k-correct package will be small, and this 
will not significantly affect our analysis. 

The goal of our work is to provide independent evolu- 
tionary histories for samples of galaxies. Luminosity evolu- 
tion corrections are therefore a result from our work, and 
cannot be assumed a-priori. We therefore choose samples 
based only on k-corrected luminosities, with limits designed 
to optimise our sampling at each redshift, such that the 
number of galaxies in each luminosity range is kept approx- 
imately constant. We show these boundaries in Fig[T] 



© 0000 RAS, MNRAS 000, 000-000 



4 Tojeiro et at 



In Fig. [2] we show the stellar masses of the LRGs in our 
sample, and how they relate to the absolute magnitudes. The 
scatter in this relationship is dominated by uncertainties in 
the stellar mass estimation, but they are not sufficient to ex- 
plain all of the observed scatter. The remaining stochasticity 
indicates intrinsic differences in star formation histories and 
mass-to-light ratios between different LRGs. 



3 METHOD 

Our methodology can be summarised in the following series 
of steps: 

(i) we stack the observed spectra in regions of colour- 
luminosity-redshift space, in order to obtain a consistent 
signal-to-noise ratio (SNR) across this plane; 

(ii) we analyse the stacked spectra with VESPA, in order 
to obtain a star- formation history (SFH), a metallicity his- 
tory and dust content representative of the galaxies in any 
given cell of colour-luminosity-redshift space; 

(iii) we estimate systematic errors due the limitation of 
the models, so that we can later study their impact on the 
recovered parameters; 

(iv) we smooth the recovered SFH in lookback time, to 
obtain a continuous function with time; and 

(v) we evolve the observed spectrum with lookback time, 
to obtain a rest-frame spectrum as a function of time from 
which we can compute any suite of observed-frame colours. 

In the rest of this section we explain in detail each of 
these steps. 

3.1 Stacking the data 

We stack galaxies of similar colour, luminosity and at the 
same redshift. The resulting spectrum gives a weighted evo- 
lution of all the galaxies within the stack. 

We work on a fixed grid in redshift, from 0.15 to 0.5 
in steps of 0.05. At each redshift, we split the sample into 
four luminosity slices, such that each slice has approximately 
the same number of galaxies. Within each redshift bin and 
luminosity slice, we split the galaxies into stacks based on 
r — i colour, with the number of stacked objects chosen such 
that each stack contains a fixed number of objects, set to 
be around 100(0. This number was chosen so that the SNR 
of the stacked spectra would permit differentiation between 
ages of interest in this paper (see Section l3~3l for details). The 
typical photometric errors increase steeply into the blue, so 
the spread in g — r colour of LRGs would be expected to be 
larger than that in r — i even if LRGs did not have any intrin- 
sic scatter in their colours. There is also a linear relationship 
between redshift and g — r up to z < 0.35, which we cap- 
ture by binning in redshift. We therefore stack inr-i colour 
rather than g—r. The result is a number of non-uniform grids 
(one per luminosity slice) in r — i colour, depending on the 
density of objects in the colour-redshift plane. We choose 

1 For the sake of clarity, we will continue to refer to each region 
bound by colour, luminosity redshift as a cell, to each column of 
cells for the same redshift as a redshift bin, and to each grid of 
colour-redshift for a given luminosity range as a luminosity slice. 



to do this in favour of a uniform grid in order to keep a 
roughly constant SNR of the resulting stacked spectra. The 
disadvantage is that sparse regions in these diagrams need 
to be larger in order to reach the target number of galaxies, 
and variations in the intrinsic nature of the galaxies within 
each cell may be larger. This does not invalidate the re- 
sulting evolution model for this cell, which will simply be 
a roughly luminosity-weighted average evolution for all the 
galaxies within the cell, although it may mask larger depar- 
tures from the mean within the cell. The above procedure 
occasionally results in cells that are smaller, in r — i, than 
the typical error on r — i for individual galaxies. This means 
that there is likely scatter of galaxies across cells, and that 
the VESPA solutions from adjacent cells should only change 
smoothly (or as smoothly as the observational uncertainty 
in whatever quantity we bin). In other words the true res- 
olution in r — i is given by the photometric errors, and not 
by the size of the cells. 

The goal of this paper is to compute a set of empirical 
evolutionary models for the colour of LRGs, and once again 
we assume that within a cell this is independent of lumi- 
nosity. This assumption is strengthened by us taking more 
than one luminosity range, but there may be residual de- 
pendencies within a given chosen range. The purpose of the 
stacking procedure, therefore, is to compute an optimal es- 
timate of the underlying spectrum, of which we assume each 
galaxy represents a random realisation. We discard luminos- 
ity information within each cell by normalising all spectra to 
a common wavelength, and we compute an inverse-variance 
weighted average, per wavelength point: 

F S (/ A ) = £^Z|M , (i) 

where F is the stacked spectrum, f\ ti is the flux point 
at wavelength A of galaxy i and the corresponding error. 
The sum is done over galaxies in any given cell. 

The resulting error in each wavelength point of the 
stacked spectrum is simply 

Note that by weighting the spectra within each stack 
based on signal-to-noise we are not modelling the total lu- 
minosity of the galaxies within the bin when we fit to the 
stacked spectrum. Instead our results should be interpreted 
as giving the history of a weighted function of the galaxies 
within that bin, although this weighting is actually close to 
a luminosity weighting, as spectral S/N is closely related to 
luminosity. 

The typical number of galaxies per stack is 1000, and 
we have a total of 124 stacks. 

3.2 VESPA and the fossil record 

We use VESPA to interpret the stacked spectrum of each cell 
in terms of a star formation an d metallicity h i story. The al- 
gorithm is described in detail in iToieiro et all l|2007fl . and it 
has been applied to SDSS's LRG and Main G alaxy samples 
result ing in a queryable database, described in lToieiro et al.l 
(2009). We refer the reader to these two papers for details 
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but, for completeness, we present now a very brief summary 
of the method. 

In short, VESPA inverts the equation F\ = F\, where 
Fx is the observed rest-frame flux, and the model flux is 



F A (to) 



fdust(rx,t)ip(t)Sx(t, Z)dt, 



(3) 



to solve for the star formation rate ip(t) (solar masses formed 
per unit of time) and the age t and metallicity Z of the com- 
ponents of the stellar populations that each give luminosity 
per unit wavelength S\{t,Z), per unit mass. The depen- 
dency of the metallicity on age is unconstrained, turning 
this into a non-linear problem. To get around this, we inter- 
polate between the tabulated values of Z given by the SPS 
models (see Section |4| giving a piecewise linear behaviour: 

S'x(t, Z) = g(t)Sx(t, Z a ) + [1 - g(t)] Sx(t, Z b ), (4) 

where S(t,Z a ) and S(t,Zi,) are equivalent to S\(t,Z) in 
Eq. 0, but at fixed metallicities Z a and Z\>, which bound 
the true Z. 

Solving the problem then requires finding the correct 
metallicity range. One should not underestimate the com- 
plexity this implies - trying all possible combination of con- 
secutive values of Z a and Z b in a grid of 16 age bins would 
lead to a total number of calculations of the order of 10 9 , 
which is unfeasible even in today's fast personal worksta- 
tions. We work around this problem using an ite rative ap- 
proach, which we describe in lToieiro et al.l (|2007h . 

As for the treatmen t of dust, we follow the two- 
parameter dust model of ICharlot fc Fall ((2000) in which 
young stars are embebbed in their birth cloud up to a time 
tsc, when they break free into the inter-stellar medium 
(ISM): 



fdust ( T X,t) — 



f (-ISM 
Jdust\T\ 

Jdust(T x 



)fdust(Tx C 
),t> t B C 



, t < tBC 



(5) 



where t{ sm is the optical depth of the ISM and rf c is the 
optical depth of the birth cloud. In previous runs of VESPA, 
we took tBC = 0.03 Gyrs. However, in this run we have re- 
duced resolution at the young end (see Section 13.31 for a 
discussion on the revised age grid) and we do not resolve 
star-formation under 74 Myr. We therefore take this bound- 
ary as tBC, and note that removing this dust component 
altogether has minimal effect in our results and conclusions. 

There is a variety of choices for the form of fdusti^x}- 
T o model the dus t in th e ISM, we use the mixed slab model 
of lCharlot fc Falll l|2000l ) for low optical depths (r v < 1), for 
which 

fdustirx) = J-[l + (tx - 1) exp(-T A ) - tIe^tx)] (6) 

2t A 

where E\ is the exponential integral and tx is the optical 
depth of the slab. This model is known to be less accurate 
for high dust values, and for optical depths greater than one 
we take a uniform screening model with 



fdustirx) = exp(— tx). 



(7) 



We only use the uniform screening model to model the dust 
in the birth cloud and we use tx = ry (A/5500A) -0 ' 7 as our 
extinction curve for both environments. 



Recovering the star formation history, and its dust con- 
tent, is solved by making assumptions about the form of 
fdust and Sx it, Z) and by minimizing 



x 



ExiFx-Fx) 2 



(8) 



Even though the problem has an analytical solution, a 
dataset perturbed by noise or that is otherwise deteri- 
orated leads to instabilities in the matrix inversion and 
the recovered solut ions can be entirely dominated by noise 
l|Ocvirk et al.ll2006h . VESPA has a self-regularization mech- 
anism which estimates how many independent parameters 
one should recover from a given a dataset that has been 
perturbed. The result is a parametrization which varies from 
galaxy to galaxy, depending on its SNR and wavelength cov- 
erage. 

It is the loss of resolution in lookback time, for low SNR 
spectra, that prompts the need for stacking in data-space. In 
other words, given the self-regularization mechanism and the 
inherent loss of - noisy! - information in cases of poor-quality 
data, it is not equivalent to stack in data or solution space. 
So, in general, the average of the star formation histories of 
the individual galaxies is not the same as the star formation 
history of the average spectrum. This becomes less and less 
true for increasingly better data of the individual objects, 
but in the case of the LRGs the data quality is sufficiently 
poor - especially at higher redshift - that stacking becomes 
necessary. 

The physical quantities output by VESPA for each 
stacked spectrum are: 

• star formation fraction in time interval Atj, Xj\ 

• mass-weighted metallicity for mass formed in time in- 
terval Atj, Zj\ 

dust attenuation for stars younger than 0.074 Gyrs, 



tbc; 

• dust attenuation for all stars, tism- 

Given our choice for stacking method and the lack of 
overall normalisation, we cannot convert the mass fractions, 
Xj, into an absolute mass recovered in each time interval. 
This, however, does not pose a problem as we aim to com- 
pute a set of colour evolution tracks, for which an overall 
normalisation is not needed. For any given galaxy within a 
cell, the overall normalisation is given by its redshift and 
apparent magnitudes. 



3.3 The age grid of VESPA 

In its original configuration, the VESPA age grid runs be- 
tween 0.002 and 14 Gyr, in logarithmically spaced time bins. 
This, however, means that the width of the final bin is 
around 5 Gyr. This in turn is comparable to the stretch 
of lookback time that the LRG sample probes (roughly be- 
tween z=0.5 and z=0.1) and we therefore have very little 
sensitivity to the exact formation epoch of LRGs. The origi- 
nal configuration was so designed because typical SDSS data 
for individual objects does not normally allow us to distin- 
guish between populations that are separated in age by less 
than the bin widths. 

The large SNR obtained from stacking, however, opens 
up the possibility to use a revised grid that has more sen- 
sitivity to older populations, at the expense of losing time 
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resolution at the young end (the number of bins and overall 
structure must remain the same, for reasons constrained by 
the algorithm). We therefore construct an alternative age 
grid that we use to analyse the LRGs with VESPA. The 
boundaries of this new grid are: 0.002, 0.074, 0.177, 0.275, 
0.425, 0.658, 1.02, 1.57, 2.44, 3.78, 5.84, 7.44, 8.24, 9.04, 
10.28, 11.52 and 13.8 Gyr. Whereas these seem ambitious, 
statistically the data should be able to differentiate between 
models in adjacent bins: the differences between models of 
adjacent ages (normalised to a common value at a given 
wavelength) are several times that of the statistical noise in 
the stacked data. In practice, however, systematic errors can 
dominate - see the next Section for details. 

Finally note that given the high SNR of the stacked 
spectra, VESPA always runs to its highest time-resolution. 

3.4 Error analysis 

VESPA was designed to deal with limitations coming from 
photon-noise. However, we are purposely putting ourselves 
in the limit where the photon-noise is negligible, and the 
quality of the fits is dominated by limitations in the mod- 
elling. We do not have a set of models that give statistically 
good fits in the limit of high-SNR and this has two potential 
effects: i) an error budget dominated by systematics in the 
modelling inaccuracies; and ii) the recovered parameters are 
biased. 

To deal with the model inaccuracies, we make the as- 
sumption that if the models (SSPs, dust and parametriza- 
tion) could represent galaxies perfectly, then the recovered 
solution would match the stacked spectrum exactly. Under 
this assumption, the residuals from a fit in each cell can be 
interpreted as an overall error including statistical photon- 
noise errors and systematic model errors. 

(i) we run VESPA on the stacked spectrum and errors 
given by equations ©, and calculate the residuals r\ — 
\F X -F X \; 

(ii) we smooth r\ with a box-car filter to keep the large- 
scale variation but attenuate pixel-to-pixel fluctuations; 

(iii) we take the smoothed r\ as the new error, a t ) f tal and 
re-fit the stacked spectrum using this new error. 

Fig. [3] shows a typical example of the residuals and the 
new estimated error. Therefore we allow extra freedom in 
the regions where the models cannot fit the data, by assum- 
ing that the residual between data and best-fit model gives 
an indication of the 1-sigma confidence region for system- 
atic problems with the models, combined with the statisti- 
cal error. As covariances will vary between different models, 
we assume this systematic is uncorrelated for flux values 
at different wavelengths, to avoid placing further biases on 
acceptable solutions. This estimate of the combined error 
replaces the instrumental statistical error in all of our fits. 
Our estimated systematic error will almost certainly be too 
small, because the best-fit parameters were chosen to match 
the data, and minimise this difference. Fundamentally, this 
is a problem that cannot be solved without better models. 
However the methodology outlined in this section should at 
least provide a approximation for the error and will be better 
than not allowing for this problem. 

We translate this error into an error bar on the recov- 
ered physical parameters by constructing N random reali- 
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Figure 3. Estimating a new error vector. Red: residuals 
smoothed with a box-car filter; black: photon- noise errors from 
the stack given by Eq. The new estimated error larger than 
the stack error by around a factor of 10. 

sations of the best-fit spectrum, using the error calculated 
as per this section, and analysing these random realisations 
using VESPA. From this set of N solutions we can estimate 
the variance and co-variance of all the parameters of in ter- 
est. More details are given in lToieiro et al.l (|2007l , 120091 '). 

While these residuals provide a relative comparison of 
our current ability to model galaxies by comparing solutions 
obtained with different sets of models (SSPs, dust, etc), we 
must be careful not to interpret any variation in the recov- 
ered parameters as an absolute error bar - different models 
can be wrong in the same way. We analyse these differences 
in Section [5] 

Finally, we should also note that the difference between 
models in adjacent bins (see Section ^. 3[) can be of the order 
of - or smaller than - this estimate of the systematic error. 
Ignoring the wavelength dependence, the average systematic 
error would correspond to roughly a 1 Gyr error on an 8 Gyr 
population. 

3.5 Smoothing the fossil record 

VESPA assumes that the star-formation rate (SFR) is con- 
stant within a time bin, and this gives results with discon- 
tinuities in SFR across bin boundaries. This is an obvious 
over-simplification of the problem, as any star formation his- 
tory is likely to be continuous over an arbitrary set of bound- 
aries, if sampled at high enough resolution. For the purposes 
of this paper, this over-simplification is problematic because 
a discontinuous SFR will result in a physically discontinuous 
colour evolution. 

To solve this problem, once VESPA has produce model 
parameters, we replace the top-hat bins of constant SFR 
with Gaussian distributions. The widths and heights are cho- 
sen to be such that the mass formed within each pair of the 
old sharp boundaries does not change by more than 0.3%. 
The mass integrated over all ages is kept constant. These 
new Gaussian representations for each population (previ- 
ously given by a single top-hat bin) are sampled at a much 
higher rate in lookback time, such that we can calculate a 
smooth colour evolution. Each new Gaussian bin has the 
same metallicity as the old corresponding histogram. 

We do not expect this change in the SFR as a func- 
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tion of time to make a large difference in the interpretation 
of the spectra. This is because, by construction, the original 
boundaries are such that the spectral evolution between each 
boundary is not too significant. The offset between the spec- 
tra calculated top-hat and Gaussian bins is either smaller or 
of the order of the statistical error shown in Fig. O The flux 
obtained with the Gaussian bins is only used to compute the 
observed colours and their evolution with redshift, and the 
effect of this difference in the computed colours is negligible. 



3.6 Evolving the observed spectrum 

Our goal is to compute the rest-frame spectrum of each cell 
(representing an ensemble of galaxies) at any point t e > t g , 
where t g is the lookback time of the cell in question. We 
compute 

Fx(t e )= m k fx(t k -t e ,Z k ), (9) 
t k <t e 

where f(t, Z) is the flux of a population of age t and metal- 
licity Z, and mu = g(tk)Atk is the mass formed at as- 
suming a SFR given by g(t). For the case where t e = t g then 
we simply recover the observed spectrum, as shown in the 
section above. 

From Fx(t e ) we can easily compute any observed frame 
colours, by applying the relevant K+e corrections (fully 
known). As noted before, we cannot compute apparent mag- 
nitudes due to a lack of normalisation. However, for any 
individual galaxy the normalisation is set by the observed 
apparent magnitude and redshift, and F\(t e ) gives then not 
only the colour, but also the luminosity evolution of that 
galaxy, assuming that the SFR in the galaxies follows the 
weighted average within the cell. 



3.7 Modelling dust 

Dust evolution does not leave a footprint in a galaxy's spec- 
trum today, so it does not affect our modelling. However, in 
order to compare the colour tracks presented in Section f5. 3. 2 1 
to observed populations of galaxies at different redshifts, we 
do need to treat the evoluti on of dust. Not e that for the 
two-parameter dust model of lCharlot fc Fall (|200Cj ). in ad- 
dition to the inter-stellar dust component, young stars are 
allowed extra dust extinction to account for the effects of 
their birth cloud, which takes some time to dissipate, of the 
order tsc- Both of these components are expected to evolve 
with redshift. We can envisage three possible solutions to 
this problem: 

(i) we can assume that the values we recover for tbc and 
tism remain valid for the entire history of the galaxy. I.e., 
when t e is such that a population becomes younger than 
tBC then we apply the value of tbc obtained at t g , and we 
always apply the same value of tism to all population; and 

(ii) we can estimate the evolution of tbc and tism from 
values observed for particular galaxies as a function of red- 
shift, and apply these accordingly. 

(iii) we can do the comparison in a dust-extinction cor- 
rected colour-colour or colour-redshift plane, in which case 
no dust evolution should be applied to the colour tracks. 



There are advantages and disadvantages to all meth- 
ods. In the case of (i), we are over-simplifying the problem 
by ignoring a potential evolution with redshift, but we are 
keeping the information that relates to each individual cell. 
In the case of (ii) we are using a model for the evolution of 
dust, but we are assuming that the galaxies that we observe 
at low- and high-redshift are part of the same population. 
This latter assumption is exactly what we are looking to 
avoid by tapping into the fossil record. Option (iii) is prob- 
ably the most robust, but requires modelling of galaxies at 
high redshifts. In this paper we plot the colour-tracks over 
observed colours, and for simplicity we apply the correction 
as given by (i) above. For other applications, colour evolu- 
tion can be supplied with or without dust evolution. 

3.8 Summary of outputs 

It is worth summarising what the outputs of the analysis de- 
scribed in this section are. They can be broadly grouped into 
two types. On one hand we have measured quantities that re- 
late to the position of the cell at a given redshift and colour, 
e.g. dust information (tism, tbc), mass-weighted metallic- 
ities or mass-weighted ages. On another hand we have the 
colour-tracks as a function of redshift, which are inferred 
evolved quantities. All of these results have a dependence on 
the SSP synthesis codes. We describe the SSP models we 
use in the next section. 



4 MODELS 

VESPA models the spectrum of a galaxy as a superposition 
of simple stellar populations (SSPs) of different ages and 
metallicities. At the heart of this process lies the assumption 
that we know the spectral signature of different SSPs, and 
that we sample the full parameter space needed in order to 
model observed galaxies. 

There are three key steps that go into constructing an 
SSP. Firstly we need a description of a star's evolution given 
its mass and metallicity, in terms of observable parameters 
such as effective temperature or bolometric luminosity. This 
is traditionally given by the isochrones. Several groups have 
published sets of isochrones, which aim to model all stages of 
stellar evolution. Secondly, we must assume an initial mass 
function (IMF), which gives the correct weight as a function 
of mass, for stars formed in a single cloud of gas. Stars of 
different mass evolve with different time-scales, so by com- 
bining the isochrones and the IMF one can correctly popu- 
late a colour-magnitude diagram across the different stages 
of stellar evolution. The final step is to assign a spectrum 
to stars with different parameters, and this is done using 
the spectral libraries. Spectral libraries can be empirical or 
synthetic, with the former suffering from poor sampling of 
the parameter space of stellar evolution (dictated by the 
chemical enrichment of our own solar neighbourhood), and 
the latter suffering from deficiencies in our current ability to 
model stellar evolution. 

Crucially, the choices, treatments or calibrations in- 
volved in all of these three steps differ across different 
SPS codes, leading to unavoidable differences in the spec- 
trum of SSPs of fixed age and metallicity. These differ- 
ences are naturally centred around the least understood 
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stages of stellar evolution, such as the thermally-pulsating 
asymptotic giant branch (TP-AGB) phase, the supergiant 
phase, or horizontal branch stars, and the datasets used 
for internal calibrations. The first efforts to accurately es- 
timate the effects of the Horizontal Branch, Red Gi ant and 
TP-AGB b ranch on SSPs came f r om Ijimenez et all i 
Marastonl dl998l); Ijimenez et alj (|2004T ); IMaraston 



as it provides the best coverage in age and metallicity. The 
wavelengt h coverage was extende d to the UV following the 
method in IMaraston et al.l (|2009h . We adopt a Kroupa IMF 
|Kroupall200ll ). 



1998 



2005) 



Jimenez et all (jf 9981 , 12004^ " built an analytic model to de- 



scribe this stages of stellar evolution paying careful attention 
to use observations of individual and resolved stars to cal i- 
brate the stellar evolution models (see Ijimenez et aill 19951 ). 
They pointed out how overestimations of the TP-AGB and 
lack of morphological description of the HB compromised 
seriously conclusions drawn from SSPs. They were, also, 
the first ones to emphasize the importance of stellar inte- 
rior tracks being compu ted self-consistently with the stel- 
lar atmospheres models ijjimenez et al.i ri998. 20Q4|). Several 
authors have since followe d this approach to estimate the 
broad-band SSPs (see e.g. IConrov et al.ll2009l among oth- 
ers). 

IConrov et al.l (|2009h . for example, look at the impact 
of these choices and assumptions on the recovered physical 
properties of galaxies, using broadband colours, with the aim 
of estimating the systematic error that arises from the limi- 
tations of the models. Here we do not take such a systematic 
approach, but we do conduct our analysis using three sets of 
popular SPS codes and investigate the resulting differences. 
The choices of IMF, stellar libraries or method for comput- 
ing the stellar energetics cannot always be matched across 
different SPS codes. In each case we take the combination 
of these that best suits our application. We briefly describe 
the three sets of models next. 



4.1 BC03 



With the BC03 models ilBruzual fc Charlodl2003l ) we adopt 
a Chabrier initial mass function dChabrierll2003l) and Padova 
1994 evolutionary tracks (lAlongi et all 1993; Brcssa n'et al.l 
1 19931 ; iFagotto et all Il994al lbl; iGirardi et all 1 19961 ). We use 
the BC03 models w ith the empirical STELIB library 
l|Le Borgne et al.ll2003l ) in the optical (3 200A to 9500A) and 
the theoretical BaSeL 3.1 stellar library dLeieune et al.fl997l . 
ll998l ; IWestera et alj |2002i) to either side of that range. 



4.2 M10 

The M10 models (Maraston & Stromback , submitted) 
are the new, h igh-resolution version of the M05 models 
l|Marastonll2005l ). The energetics and stellar evolution calcu- 
lations are unchanged from the models in [Marastonl (Il998l ) 
and M05, so the post main-sequence con tinues to be treated 
with the fuel-consumption theore m dRenzini fc Buzzonil 
Il986l ; lBuzzonlll989l ; lMarastonlll998l ), leading to a more ro- 
bust treatment of advanced stages of stellar evolution, such 
as the TP-AGB phase. The M10 models and publication fo- 
cus not only on delivering a high-resolution version of the 
M05 models, but also on exploiting differences arising from 
stellar libraries and their calibration. Therefore, models are 
provided in a range of stellar libraries, each with differ- 
ent coverage in age, metall icity and wavelength. Here w e 
choose the MILES library ((Sanchez-Blazauez et al.| [2006). 



4.3 FSPS 

The Flexible Stellar Population Synthe sis (FSPS) code 
dConrov et al . 20091; IConrov" fc Gunnll2009l ) takes a novel ap- 
proach to the computation of SSP spectra. By parametriz- 
ing uncertain stages in stellar evolution and allowing these 
parameters to change freely, the authors attempt to both 
quantify our ignorance about stellar evolution in terms of 
derived galaxy properties, but also to calibrate their mod- 
els by finding the combination of parameters that best de- 
scribe certain observations. Furthermore, models are pro- 
vided with a choice of isochrones and stellar library. Here 
we ch ose the latest Padova evolutionary track s calcula- 
tions (|Marigo fc Girardill2007l ; iMarigo et al.ll2008l ), and the 
MILES stellar library. An UV extension was obtained by 
using the theoretical BaSeL spectral library. In spite of the 
freedom provided by these models, here we use them simply 
at their default settings and with the combination of param- 
eters that describe the TP -AGB phase tha t were found to 
best fit star cluster data bv lConrov fc Gunnl (|2009h . The op- 
portunity remains to explore further the flexibility of these 
models - we le ave that for a follow up paper. We use a 
IChabrierl l|2003h IMF. 



4.4 a - enhancement 

The metallicities implicit in the SPS models and quoted 
throughout this paper refer to [Fe/H] abundances. However, 
it is now observationally well established that ETGs and 
bulges have a larger abundance of the so-called a-elements 
(O, Ne, Mg, Si, S, Ca, Ti) than that predicte d by SPS mod- 



els, given the measured Fe abundances (e.g IWorthev et al 
| l992 j; lDavies et al.ll 19931; iFisher et al.lll995l;]Longhetti et~. 
hood ; IMaraston et alj " l2003l : IThomas et al.l [20031 . 120051 and 

references therein). 

a— elements are mostly produced through the collision 
of a-particle nuclei in type-II supernovae of very massive 
stars, whereas Fe-peak elements are associated with type-la 
supernova e, typically associated wi th older and lower mass 
stars fe.g. IWooslev fc Weaverlll995l ). Given the different de- 
lay times for the two types of stellar explosions, different 
[a/Fe] abundances have therefore been interpreted as a sig- 
nature for the l ength of time over which the star formation 
happ ened (e.g. iPagel fc Tautvaisienel 1 19951 : IThomas et alj 
120051 ). The mismatch with the Fe abundances predicted by 
SPS models therefore occurs because empirical stellar li- 
braries are primarily constructed by stars in the solar neigh- 
bourhood, which has a different chemical enrichment and 
star formation history than that which is typical of bulges 
and massive elliptical galaxies. 

Whereas some effort has gone into calibrating certain 
spectral indices for different a-element abundances (e.g 
IThomas et al.l [2003). this has not yet been done for the 
full spectrum. Different chemical abundances are therefore 
a known limitation of current SPS models based on em- 
pirical libraries, if one wants to use a full-spectral fitting 
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technique. Theoretical libraries however, can provide some 
insight on how differ ent chemical abund ances ratios affect 
galactic spectra (e.g. ICoelho et al.l 120071 ). For the moment, 
such discrepancies (surely to be systematic when looking at 
a population such as the LRGs) are automatically bundled 
with other unknown systematic errors in Section T3. 41 
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5 RESULTS 

The main product of this work is a set of colour evolutionary 
tracks as a function of redshift and colour. In conjunction 
with high redshift data to match to, these can be used to 
solve some galaxy evolution cases, and we will consider such 
an analysis in a companion paper. In this paper we now 
explore some of the most immediate results, such as depen- 
dence of dust, age and metallicity on colour and redshift. 

The dependence on the underlying SSP modelling is 
investigated in Section [6] In this section, for clarity, we will 
often show results by choosing one of the three sets of SPS 
sets of models, and we append the matching plots using the 
other two sets at the end of this paper. Our choice of SSP 
model in the main text in each case should not be interpreted 
as supporting one over the others. When practical, we will 
show results from all three sets of models alongside each 
other. 

We start by showing some a typical fit and residuals. 



5.1 Typical fit and residuals 

In Fig.[4]we show a fit for one of the stacks, and in Fig.[5]the 
corresponding residuals in units of the stack noise, as given 
by Eq. p]). This fit was obtained using the FSPS models 
(results for the same stack, with Mastro and BC03 mod- 
els can be found in Appendix A). In this case we can see a 
visually good match on the blue end, and an increasing ten- 
sion between data and models towards the red. The detail 
of the residuals plot reveals that there are significant depar- 
tures from the data at all wavelengths, but more notoriously 
blueward of 3800A and redwards of 5800A . 

Fig. [5] also shows regions of the spectrum that were 
excluded from fits that give the results presented in this 
section. Inclusion or exclusion of these regions does not affect 
the results in this section in any significant way. 



5.2 Measured quantities 

In this section we show how ages, dust and metallicity vary 
with colour, luminosity and redshift in our LRG samples. 
The luminosity ranges refer to those shown in Fig. [T] 

When we show or quote errors in this Section, they 
refer to variations in colour, for a given redshift bin and 
luminosity slice. These variations are typically larger than 
the systematic error obtained as per Section 13.41 which is 
likely to be an underestimation of the true systematic error. 
The effect of the statistical error on the recovered solutions 
is negligible. 

A summary of the results in this section can be found 
in Fig. ED 
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Figure 4. A typical fit, using the FSPS models (equivalent plots, 
for Mastro and BC03 models can be found in Appendix lAlJI . The 
black line is the data, and the red line the best-fitting model. The 
vertical yellow and green dashed lines guide the eye by showing 
some common emission and absorption features. Although plot- 
ted, some regions of the spectrum were excluded from the fit - see 
Fig. [5] 



5.2.1 The ages of LRGs 

The fiducial model is that LRGs formed mostly in one short 
epoch of star- formation at high-redshift. Therefore, the age 
of the oldest stars in LRGs of different redshifts should be 
consistent with a single epoch of formation. This is in prin- 
ciple testable and our revised age grid is sufficiently fine to 
test this hypothesis (see Section [3. 3p . 

In contrast, we find best-fit solutions given by the mod- 
els that are systematic ally older than the age of the Universe 
IjKomatsu et al.l 12009 ). Moreover, we found this behaviour 
with all three SPS models. We find that the best-fit solution 
is still too old in many cases, even after adjusting the er- 
ror in the flux according to the method in Section f3. 41 This 
suggests that the behaviour is systematic rather than sta- 
tistically random. We therefore decide to impose a strong 
prior on possible solutions, such that star-formation cannot 
happen in a bin whose youngest boundary has an age older 
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and we see the expected ageing of galaxies towards lower 
redshifts (largely dictated by our prior on the age of the 
Universe) . 

In Fig. [6] (3rd row) we collapse the histogram in colour, 
and show the evolution in redshift obtained with different 
SPS models. The error bars show the error on the mean over 
colour, and so are representative of the variation with colour, 
in a given redshift bin. In the 4th row of Fig. [S]we add the 
lookback time to the redshift of each stack, and show the 
formation time in the Earth-frame. 



5.2.2 Recent to intermediate star formation 

Collapsing the full SFH into a mass-weighted age, however, 
can mask interesting information such as the amount of re- 
cent star formation. In Fig.[5]and, collapsed in colour, in the 
5th panel of Fig. [6] we show the fraction of star-formation, 
by mass, recovered in bins up to 3.8 Gyr in the rest-frame 
of each stack. A summary is given in Table [JJ where we split 
this range further in intermediate and young ages. 

Although a fixed age in the rest frame is equivalent to 
a different fraction of the galaxies' ages as we go back in 
redshift, 3.8 Gyr serves as a good split between stars that 
formed in the oldest, fiducial burst, and anything that may 
have followed. We discuss these results in Section [S] 
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Figure 5. The residuals in units of the stack noise, for the fit 
showed in Fig. [4] The vertical dashed lines guide the eye by show- 
ing some common emission and absorption features. The regions 
shaded by horizontal lines were excluded from the fit. 



than the age of Universe. All results from hereon have this 
prior with the age of Universe set to be 13.8 Gyrs at z=0. 
The best-fitting solutions obtained with this prior have a for- 
mally worse x 2 than those without the prior, as expected. 
There is important information in the residuals of the solu- 
tions obtained with and without this prior - one is clearly 
wrong, but provides a better fit. The difference between the 
two sets of residuals should highlight which spectral features 
yield this difference. We leave this analysis to a companion 
paper. 

Fig. shows the dependence of the rest-frame mass- 
weighted age of each stack of galaxies, as a function of colour 
and redshift for four luminosity slices, and calculated us- 
ing the MIO models (identical plots obtained with BC03 
and FSPS are given in Appendix lA2|) . We calculate a mass- 
weighted age as 



{tags) — 



/* i tiXi 



(10) 



where we take ti as being the mean age of bin i. We 
see no strong evidence for a dependence of (t a ge) on colour, 



5.2.3 Metallicity 

Fig. [9] shows the mass-weighted metallicity as a function 
of r — i colour and redshift, for four luminosity slices, and 
calculated using the Mastro models (identical plots obtained 
with BC03 and FSPS are given in Appendix A). For each 
cell, we calculate the mass-weighted metallicity using the 
full star-formation history as 



(Z) 



(ii) 



To make any possible trend with redshift more clear, 
we average over colour and show the resulting redshift de- 
pendence in Fig. [6] (first row) . The horizontal dashed lines 
show the metallicities provided by the models - note that 
different sets of models sample the metallicity space in dif- 
ferent ways. We discuss the different behaviour of each set 
of models in Section [6] As in the previous section, the error 
bars are show the error on the mean over colour. 



5.2.4 Dust 

Fig. [10] shows the dust attenuation as a function of r — i 
colour and redshift, for four luminosity slices. To make any 
possible trend with redshift more clear, we average over 
colour and show the resulting redshift dependence in Fig. [6] 
(second row). The error bars are errors on the mean, and 
therefore give an indication of the scatter of dust with colour, 
for each redshift bin. We discuss these results in Section [6] 



5.3 Evolved quantities 

We can use the full star-formation history of each cell to esti- 
mate the colour evolution of a galaxy within that cell. This 
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Figure 6. The evolution of mass-weighted metallicity (first row), dust extinction (second row), mass-weighted age (third row), formation 
epoch (forth row) and recent star-formation fraction (fifth row) with redshift for stacks of different luminosities. The thickness of the 
line in all plots represents the luminosity of the galaxies in each stack (thinner for the faintest, and thickest for the brightest). At each 
redshift, the quantities are averaged over r — i colour. The horizontal dashed lines in the first row represent the metallicities provided by 
each set of models. Extrapolation beyond the extreme values is not permitted. The horizontal dashed lines in the 3rd row show the age 
boundaries used by VESPA. The error bars are the error on the mean over colour, and are representative of the variation with colour in 
a given redshift bin. See text for more details. 
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Figure 7. Mass-weighted age in Gyr (see Eg. 1101) for galaxies of different colour, redshift and rest-frame r— band luminosity, analysed 
with the M10 models. The luminosity ranges change with redshift, and are shown in Fig. [T] We only show data for regions of parameter 
space with a sufficient number density of galaxies. In practice, there are a few galaxies (less than 20 per redshift bin) outside of the 
coloured areas, but these have an insignificant impact on the recovered solution. In Fig. [6] (3rd row) we average over colour to show the 
trend with redshift for each luminosity slice. 
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Figure 8. The logarithm of the fraction of star formation (by mass), recovered in bins up to 3.8 Gyr for galaxies of different colour, 
redshift and rest-frame r— band luminosity, analysed with the M10 models. The luminosity ranges change with redshift, and are shown 
in Fig. [T] We only show data for regions of parameter space with a sufficient number density of galaxies. In practice, there are a few 
galaxies (less than 20 per redshift bin) outside of the coloured areas, but these have an insignificant impact on the recovered solution. In 
Fig. \E\ (5th row) we average over colour to show the trend with redshift for each luminosity slice. 
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Figure 9. Mass-weighted metallicity (see Eq. 1111) for galaxies of different colour, redshift and rest-frame r— band luminosity, analysed 
with the M10 models. The luminosity ranges change with redshift, and are shown in Fig. [T] We only show data for regions of parameter 
space with a sufficient number density of galaxies. In practice, there are a few galaxies (less than 20 per redshift bin) outside of the 
coloured areas, but these have an insignificant impact on the recovered solution. Fig. [6] averages over colour to show the trend with 
redshift for each luminosity slice. 
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Figure 10. Inter-stellar dust absorption for galaxies of different colour, redshift and rest-frame r— band luminosity, analysed with the 
M10 models. The luminosity ranges change with redshift, and are shown in Fig. [T] We only show data for regions of parameter space 
with a sufficient number density of galaxies. In practice, there are a few galaxies (less than 20 per redshift bin) outside of the coloured 
areas, but these have an insignificant impact on the recovered solution. Fig. \E\ averages over colour to show the trend with redshift for 
each luminosity slice. 
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0.15 < 2 < 0.35 0.35 < z < 0.5 

SPS model Luminosity range Young Intermediate Young Intermediate 

SFF < 0.7 Gyr SFF 0.7 - 3.8 Gyr SFF < 0.7 Gyr SFF 0.7 - 3.8 Gyr 



BC03 


Li\J ^Drigntcstj 
Ll 
L2 

L3 (faintest) 


n nm Q_l_n nnnon 
U.UUloztU.UUUzy 

0.0019±0.00028 

0.0019±0.00033 

0.0013±0.00023 


U.U4 1 ztU.UUoo 
0.048±0.011 
0.054±0.011 
0.056±0.015 


U.UUDo±U.UUUZ4 
0.013±0.00028 
0.010±0.00024 
0.013±0.00029 


U.UDoztU.UUDo 
0.126±0.0057 
0.114±0.0023 
0.0776±0.0040 


M10 


LO (brightest) 
Ll 
L2 

L3 (faintest) 


0.0018±0. 00065 
0.0019±0.00069 
0.0015±0.00057 
0.00311±0.0022 


0.036±0.0033 
0.043±0.0071 
0.046±0.0051 
0.056±0.0098 


0.00014±4e-06 
0.00015±3e-06 
0.00044±l-06 
0.0009±2e-06 


0±0 
0.00280±7e-05 
0.0038±0.0001 
0.018±0.0004 


FSPS 


LO (brightest) 
Ll 
L2 

L3 (faintest) 


0.0068±0.0066 
0.0096±0.0094 
0.0022±0.0020 
0.013±0.0086 


0.0017±0.0007 
0.0020±0.0002 
0.0025±0.0007 
0.0076±le-05 


o o o o 
o o o o 


0.0046±0.0013 
0.0063±0.0015 
0.0048±0.00076 
0.026±0.00078 



Table 1. Summary of the recovered star-formation fractions at young and intermediate ages, averaged over two redshift ranges, for all 
luminosity slices and the three SPS models we consider. As in Fig. [6] the errors come from the variation with colour at a given redshift 
bin. 



information can be coupled with the apparent magnitude 
and exact redshift of the galaxy to calculate its luminos- 
ity evolution. In this section we present a selection of these 
colour tracks for a sample of galaxies selected in colour, lu- 
minosity and redshift. As before, we present the results using 
only one set of SPS models in this section (FSPS), and pro- 
vide the corresponding plots using the other two SPS models 
in Appendix [A3] 

5.3.1 Spectro and model magnitudes 

The fibre aperture in the SDSS has a fixed size of 3 arc- 
seconds. Apparent sizes of objects are often larger than this, 
leading to an unavoidable discrepancy between the magni- 
tudes obtained from integrating a spectrum over a filter's 
response (the spectro magnitudes), and the best estimate 
of the photometric magnitudes (the cmodel magnitudes). 
We do our stacking using the latter, but we use the spec- 
trum to obtain our star formation and metallicity histories, 
which in turn refer only to the 3 arc-second fibre. There- 
fore we need to be able to map one quantity to the other, 
and understand any possible biases in this relationship. The 
spectrophotometric calibration can also affect this mapping, 
and there is a known offset between the spectro and fibre 
magnitudes (obtained from the photometry, within the 3 
arc-seco nd aperture) since DR6 due to h ow this calibration 
is done (|Adelman-McCarthv et alj 120081 ). This offset, how- 
ever, has an insignificant dependence on photometric band 
and it is not a problem for our colour selection. 
Fig. [11] shows the distribution of 

A[r - i] = [r - i] spcc tro - [r - i] C modei, (12) 

for a random sample of 10,000 LRGs. This scatter can be 
better understood by explicitly writing 

[r - i]cmodci = [r - i] 3 » + <5 P hoto + [r — i] e xtra (13) 

[V — i]spcctro = [r — + ^spoctro, (14) 

where [r — i] extra is the difference in colour due to a radial 
colour gradient extending outwith the 3 arc-second fibre, and 



0.30 




A r-i 

Figure 11. The distribution of Ar — i (Eq. I12H for a random 
sample of 10,000 LRGs. 



5s P cctro and 5 p hoto are noise in the spectroscopic and photo- 
metric measurements, respectively. Assuming that <5 sp ectro 
and <5 p hoto are stochastic components, an offset from zero 
in A[r — i] would be indicative of a systematic behaviour 
in [r — i] extra- Fig. [11] shows no evidence for such an offset. 
Fig. [12] shows A[r — i] as a function of redshift - there is 
an indication of a positive slope at high-redshift. A positive 
A[r — i] means a larger spectro colour, which in turn means 
a bluer colour inside the fibre, when compared to the full 
aperture. This is counter-intuitive for two reasons - firstly 
one would expect the proportion of light falling into the 3 
arc-second fibre to go up with redshift, as apparent sizes get 
smaller; and secondly, one would expect the outer regions to 
be bluer in colour in comparison to the central regions. 

Note, however, that at high redshift we are intrinsically 
selecting increasingly redder galaxies (in cmodel colours) 
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Figure 12. The mean and scatter of Ar — i as a function of 
redshift. See text for an explanation of the upturn at high redshift. 



due to the LRG sample selection. These, in turn, are likely 
to have the largest positive values of <5 p hoto and, in conse- 
quence, of A[r — i]. There is an intrinsic Malquist-type se- 
lection bias in A[r — i] simply due to the fact that we select 
in [r — jjcmodci- The selection of galaxies for each cell will 
be biased in a similar way, but this bias is known as we can 
compute A[r — i] for each cell. 

Without independent photometry, it is hard to disen- 
tangle the redshift-dependence of this selection bias from a 
true dependence of [r — i] C xtra in colour, or redshift. How- 
ever, Figs. llll and ll2l show no evidence of a detectable signal 
in [r — i] extra- Therefore, to translate from spectro colour to 
cmodel colour we simply apply a correction to the begin- 
ning of each track. We do this in all of the plots in the next 
section. 



5.3.2 Colour tracks 

Figs. [13] and [14] shows the predicted evolution of r — i and 
g — r colour with redshift, for four combinations of colour 
and luminosity and obtained with the FSPS models. Fig. 1151 
shows this evolution in the r — i vs g — r plane, using the 
same models. In each case we over-plot with contour lines 
the number of observed LRGs for in the same luminosity 
range. The contours are wider for the fainter sample, but we 
cannot immediately tell whether this is due to an increased 
photometric error for fainter objects, or due to any intrin- 
sic colour-luminosity relation. We explore these results and 
their model dependence in Section [6] 



6 MODEL DEPENDENCE AND 
INTERPRETATION 

The results presented in the previous section cannot be in- 
terpreted without addressing the fact that each set of models 
often gives a different result. In this paper we do not directly 
tackle the question of which set of models is the most cor- 
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Figure 13. Predicted evolution of r — i colour with redshift, cal- 
culated according to Eq. {S}, and the solutions presented in Sec- 
tion !5.2l The different panels show examples at different combina- 
tions of colour and luminosity. The black contour lines show the 
number density of LRGs within the respective luminosity slice, 
and the coloured lines show the tracks from cells at four different 
redshifts: 0.175, 0.275, 0.375 and 0.475. The star shows the posi- 
tion of the cell the tr ack relates to. The gre en dashed line shows 
the LRG model from iMaraston et al.l {20090, for comparison. 



rect, we limit ourselves to a very basic exploration of the 
residuals in Section [6.31 

Overall, we find agreement with the fiducial model that 
says that LRGs are dominated by old, metal rich stars. 
However, we find that some small departure of this model 
is needed to fit the stacked spectra, and that small order 
changes to the fiducial model depend on the SPS model 
used. Ultimately, we are interested in seeing how the stel- 
lar evolution of LRGs translates into colour tracks, and the 
potential implication for galaxy evolution and observational 
cosmology. We begin by analysing the in-situ results of Sec- 
tion [5]2] and then we discuss the effects on the colour tracks 
of Section 15.31 
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Figure 14. Predicted evolution of g — r colour with redshift, cal- 
culated according to Eq. and the solutions presented in Sec- 
tion !5.2l The different panels show examples at different combina- 
tions of colour and luminosity. The black contour lines show the 
number density of LRGs within the respective luminosity slice, 
and the coloured lines show the tracks from cells at four different 
redshifts: 0.175, 0.275, 0.375 and 0.475. The star shows the posi- 
tion of the cell the tr ack relates to. The gre en dashed line shows 
the LRG model from Maraston et alj j2003h . for comparison. 



Figure 15. Predicted evolution of r — i colour with g — r, cal- 
culated according to Eq. (J9J, and the solutions presented in Sec- 
tion !5.2l The different panels show examples at different combina- 
tions of colour and luminosity. The black contour lines show the 
number density of LRGs within the respective luminosity slice, 
and the coloured lines show the tracks from cells at four different 
redshifts: 0.175, 0.275, 0.375 and 0.475. The star shows the posi- 
tion of the cell the tr ack relates to. The gre en dashed line shows 
the LRG model from iMaraston et alj |200Sh . for comparison. 



6.1 Measured quantities 

Fig. |6] summarises the in-situ measurements presented in 
Section E2 

We see a different behaviour in the metallicity evolu- 
tion when using different models, but all models consis- 
tently show LRGs to be metal-rich, as expected following 
the fiducial model. This difference is not a surprise - metal- 
licity is known to be more model depe ndent than age (e.g. 
iPanter et all 120081 ; iToieiro et aLI 120091 "). and even the sim- 
ple fact that the models sample the metallicities at different 
intervals will have some effect. 

The sharp increase in metallicity seen when using BC03 
models is most likely due to the age-metallicity degeneracy. 



BC03 predicts a modest amount of recent star-formation 
at low-redshift, but this fraction increases to higher red- 
shifts (5th panel). The results is a mass- weighted forma- 
tion time that gets younger with redshift, and an increas- 
ing metallicity. It is worth noting that the fraction of mass 
in young/intermediate stars is not generally uniformly dis- 
tributed over the last 3 Gyr of the galaxies. Instead, VESPA 
generally assigns the mass to one or two age bins, and the 
particular bins with the most mass change from stack to 
stack. In the case of BC03 and M10, these are firmly at the 
intermediate ages regime. This is probably indicative of spo- 
radic star formation episodes rather than a continuous star 
formation type of history, for any given stack. 

The FSPS and M 10 models paint a scenario much closer 
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to the fiducial model of passive stellar evolution, and a con- 
stant high-metallicity. However, the M10 models require a 
small amount of intermediate star formation at low red- 
shift (a little over 10% by mass for the faintest objects, and 
slightly less for brightest ones), which in turn drives down 
the mass-weighted ages of the faintest objects at low red- 
shift. Note that this is different from the reason that drives 
down the mass- weighted ages of the faintest objects in FSPS 
- in this case it is the dominant, oldest populations that 
are slightly younger. This type of distinction is possible due 
to having time-resolved star formation histories, and it be- 
comes crucially important if one aims to use the age-redshift 
relationship of LRGs to put tight constraints on H(z) (e.g . 
Jimen ez ct al. 2003; Simon et al. 2005; Crawford et al.ll2010l ; 
Stern et aLlbOloD . 

Aside from the dust results, we see little coherent depen- 
dence on luminosity, in either age, metallicity or amount of 
recent star formation. Work on ETGs has consistently shown 
that fainter objects tend to have younger stars on average 
(e.g. iThomas et al.ll2005l ; ICarson fc Nicho]||2010l ; IZhu et all 
l20ld ), but the reason why we do not see clearly in this 
study is easily understood. The range in luminosity of our 
LRG sample is much smaller than the traditional ETG sam- 
ples, which tend to go down to fainter magnitudes, leaving 
us with a small lever-arm in luminosity with which to con- 
strain this type of relationship. Furthermore, as the LRG 
sample is neither magnitude nor volume limited, any trend 
with luminosity over redshift is difficult to interpret. We do 
see a tendency for fainter objects to be younger, in both 
the FSPS and M10 cases (although for different reasons, as 
explained in the previous paragraph), but this is limited to 
2 < 0.3. This is explained both by the lack of faint galaxies 
in the sample at larger redshifts, and by the larger range in 
magnitudes in the sample at low redshifts. 

Dust extinction results are relatively consistent across 
different models. This is likely to be due to the fact that the 
dust modelling is common in all three cases, and that the 
large-scale changes due to the presence of dust are indepen- 
dent enough of large-scale changes due to age or metallicity. 
We consistently see a very clear separation in luminosity - 
fainter objects have more dust extinction than bright ob- 
jects - and an increase towards high-redshift. In the case of 
Mastro and FSPS, we also see evidence for an upturn at low 
redshifts, suggesting that perhaps there is some contamina- 
tion at his end by dustier galaxies (with some recent star 
formation in both cases, although up to 10 times more, in 
fractional mass, in the case of the Mastro models). Addi- 
tionally, dust extinction is the only of our measurements to 
show a clear trend with colour - redder cells tend to have 
larger extinction, across all luminosities. 

6.2 Evolved quantities 

Our particular interest in this paper is to see how the so- 
lutions discussed in the previous section translate into the 
colour evolution of LRGs of different colour, luminosity and 
redshift. Inevitably, the model dependence observed in the 
previous section will manifest itself in the colour tracks. 

The most obvious is the effect of recent star formation. 
An episode of star formation results in both a short and 
abrupt change in the colour tracks, and a reddening of the 
colours to higher redshift. The magnitude of these two ef- 



fects depends on the size (in mass), age and duration of the 
episode of star formation. Metallicity and dust induce less 
abrupt changes in the colour evolution. 

The interpretation of the results from the different sets 
of models is rather different. In the case of FSPS, with 
negligible amounts of recent star formation and a constant 
metal-rich population, the colour tracks follow the locus of 
observed objects very closely. As the selection box was de- 
signed around a passively evolving model, this is no less the 
expected behaviour. Furthermore, we see little dependence 
on colour or luminosity, suggesting that the differences noted 
in the previous section (namely dust and age of oldest pop- 
ulation) have a small effect on the colour evolution. 

It is a different matter in the case of the M10 or BC03 
models. With BC03, the substantial and ever-present (in 
redshift, colour and luminosity) level of recent star formation 
means that star formation events constantly push the colour 
tracks outside the contours. Depending on the duration of 
the burst (and we have limited resolution with our grid) then 
an object remains outside the selection for box for a given 
amount of time, until the excessively blue light of the young 
stars subsides enough to make the object red once again. In 
principle, there is nothing wrong with this scenario, and it 
follows that if LRGs go through small and sporadic events 
of star formation, then their true number density is greater 
than the number density measured inside the selection box 
at any given redshift. The fraction of objects not observed 
due to this is also in principle calculable, with an analysis 
of this type and good, trustworthy modelling. 

In the case of the M10 models, this effect is much less 
important due to the smaller and more restricted amount of 
recent star formation. M10 models tend to be redder (espe- 
cially in r — i), but mostly lie within the locus of observed 
galaxies, even when a small amounts of star formations is 
detected. 

Changes in colour tracks can not be attributed solely to 
the differences in the physical solutions. These were obtained 
by fitting the stacked spectra in the optical (A > 3200A, de- 
pending on the models), and we compute the colours to a 
redshift range that surpasses the optical band, so UV exten- 
sions were used to accomplish this. Differences in the spec- 
tral libraries, and in the treatment of blue stragglers or blue 
horizontal branch stars affect the inferred colour evolution 
in this regime, even when if they have limited influence in 
the optical. 

6.3 Model comparison 

A natural question to ask is: which set of models best de- 
scribes real galaxy spectra? Answering this question, how- 
ever, is far from trivial. We can use the distribution of the 
average residual per pixel (as calculated in Section [3.41 and 
shown for one stack in Fig. [3} , as the simplest possible mea- 
sure of goodness of fit. We show the distribution of these 
values for our 124 stacks and for all three SPS models in 
Fig. 1161 There is a factor of roughly 1.5 between the mean 
residual of FSPS and BC03 or M10 models. This suggests 
FSPS models provide a better match overall for fitting the 
high SNR data. The physical solutions from this set of mod- 
els are also the most in line with the LRG fiducial model, al- 
though the M10 solutions also broadly fit in with the mostly 
passive, metal-rich typical LRG picture. However the inter- 
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Figure 16. A simple measure of goodness of fit, taken as the 
average residual per pixel. In the black solid line we show the 
histogram for the results obtained with the BC03 models, in the 
red dashed line the results for the Mastro models, and in the blue 
dotted-dashcd line the results for the FSPS models. 



pretation of this result is not straightforward and there are 
a number of caveats. 

We are in a regime where neither set provides a statis- 
tically good fit to the data, and we do not know what the 
true answer should be. This means that when interpreting 
the best-fit solution and its residual when compared against 
the data, we need to bear in mind that the fitting procedure 
can compensate for deficiencies in the models by choosing 
a wrong solution. This behaviour can be explicitly seen in 
the ages of LRGs, where the best fit solution in data-space 
is known to be incorrect (see Section r5.2.1l for more details). 
In that case, we have strong physical reason to impose a 
prior on the age of LRGs based on the age of the Universe 
obtained from independent methods. An approach based on 
strong physical priors, or on requiring different wavelength 
ranges to give consistent answers, can help to identify spe- 
cific regions where each model is more likely to fail. These 
regions can in turn be interpreted in terms of stellar physics 
and library deficiencies, and be a useful tool for SPS models. 
Having a wider range of galaxy populations will also provide 
a broad test of the models. We leave this analysis for another 
publication. 



7 SUMMARY AND CONCLUSIONS 

We have computed, for the first time, colour-evolution mod- 
els for the SDSS LRG sample, that depend on luminos- 
ity, colour and redshift, and are completely decoupled from 
the selection function of the survey. This is possible due to 
a full-spectral fitting technique (VESPA), which computes 
highly-resolved non-parametric star formation and metallic- 
ity histories, as well as the dust extinction. For 124 high 
signal-to-noise stacks of spectra for galaxies with differ- 
ent colour, luminosity and redshift, VESPA fits show how 
mass-weighted ages, metallicities, recent star formation and 
dust content vary. These solutions were then translated into 
colour-evolution tracks. For this analysis have considered 



three different SPS codes: BC03, M10 and FSPS, and stud- 
ied the effect of this choice on the results. 

Our results and conclusions can be summarised as fol- 
lows: 

(i) When faced with high-quality data, the models are not 
able to produce formally good fits to the data. We use an 
updated error estimate, based on the fit residuals, as a way 
to deal with these limitations. 

(ii) The broad picture that LRGs are dominated by old, 
metal-rich stellar populations is confirmed by our analysis 
using all SPS codes. However, whereas FSPS predicts virtu- 
ally no recent star formation, M10 and BC03 require some 
amount of young stars to fit the spectra of the LRGs. 

(iii) In the case of M10 and FSPS, we see some evidence 
for dependence of mass- weighted age on luminosity, with the 
faintest sample in both cases having a lower mass-weighted 
age. However, whereas in the M10 models this is due to star 
formation at young/intermediate ages, in the case of the 
FSPS models this is due to the dominant, old population 
being younger. Distinguishing between these two scenarios 
is crucial for studies that aim to use the age-redshift rela- 
tionship of LRGs for cosmological constraints. 

(iv) All three models reveal a similar picture for the dust 
extinction: extinction increases with decreasing luminosity, 
increasing redshift, and increasing r — i colour. 

(v) Dust extinction is the only quantity we studied which 
has a significant and coherent gradient with r — i colour. 

(vi) The effect of recent star formation - even in small 
amounts - in the colour tracks can be quite dramatic, and 
temporarily remove LRGs from the SDSS LRG selection. 
It follows that, if LRGs really do go through such events, 
the number density of objects within the selection box is 
smaller than the true number density of LRGs. The fraction 
of LRGs going through this phase, as a function of redshift, 
can in principle be estimated using our models. 

(vii) A test of which set of models best fits the data is 
non-trivial as no set gives a statistically good fit to the data. 
FSPS solutions produce, on average, lower residuals than the 
other two sets of models, but the significance of this cannot 
currently be established. 

(viii) The differences in the colour tracks predicted by dif- 
ferent SPS codes are the limiting factor in using these models 
to link populations of galaxies at different redshifts, and sug- 
gest that previous analyses that require this link (any based 
on the evolution of a quantity with redshift, such as num- 
ber or luminosity densities, clustering, etc) are likely to at 
best have underestimated their errors, and at worse suffered 
from a systematic error due to our lack of understanding in 
stellar evolution. 

Lastly, we note that the results in this paper apply only 
to the LRG population, which is a subset of the ETG popu- 
lation. The relationship is more pronounced in terms of mass 
than colour - the LRG population occupies the most massive 
end of the ETG population, which is in itself predominantly 
red. 

The colour evolution tracks obtained with 
any of the SPS models can be found in 
: //www. i eg. port . ac .uk/~tojeiror/lrg_evolution/ . 
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Figure Al. A typical fit, using the M10 models. The black line 
is the data, and the red line the best-fitting model. The vertical 
yellow and green dashed lines guide the eye by showing some com- 
mon emission and absorption features. Although plotted, some 
regions of the spectrum were excluded from the fit - see Fig. IA2I 



APPENDIX A: ADDITIONAL PLOTS WITH 
DIFFERENT SPS MODELS 

In this Appendix we show the equivalent of the plots shown 
in the main text, for all other SPS models considered. All 
plots are like for like, and we refer the reader to the main 
text for an explanation. 



Al Fits and residuals 

In this section we show the equivalent of Fig.|4]and Fig.[5]for 
the same stacked spectra, analysed with the M10 models (in 
Fig. DPI and Fig. ED and with the BC03 models (in Fig. [Ml 
and Fig. EH). 
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Figure A2. The residuals in units of the stack noise, for the 
fit showed in Fig. IA1I The vertical dashed lines guide the eye 
by showing some common emission and absorption features. The 
regions shaded by horizontal lines were excluded from the fit. 



Figure A3. A typical fit, using the BC03 models. The black line 
is the data, and the red line the best-fitting model. The vertical 
yellow and green dashed lines guide the eye by showing some com- 
mon emission and absorption features. Although plotted, some 
regions of the spectrum were excluded from the fit - see Fig. IA4I 



A2 2D histograms 

In this section we present the 2D histograms of Section 15.21 
which show the dependence of age, metallicity, dust extinc- 
tion and amount of recent star formation (< 3 Gyr) as a 
function of colour and redshift. Figs. IA5I to IA8I show the 
results obtained with using the FSPS code, and Figs. IA9I 
to lA12l the results obtained using the BC03 code. 



A3 Colour tracks 

Here we present the colour tracks, as in shown Section r5.3.2l 
computed using the M10 models (Figs. [Alljl to lAlo} and the 
BC03 models (Figs. [ATol to lATS)) . 
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Figure A4. The residuals in units of the stack noise, for the 
fit showed in Fig. IA3I The vertical dashed lines guide the eye 
by showing some common emission and absorption features. The 
regions shaded by horizontal lines were excluded from the fit. 
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Figure A5. Mass-weighted age in Gyr (see Eq. 1 1 P for galaxies of different colour, redshift and rest-frame r— band luminosity, analysed 
with the FSPS models. We only show data for regions of parameter space with a sufficient number density of galaxies. In practice, there 
are a few galaxies (less than 20 per redshift bin) outside of the coloured areas, but these have an insignificant impact on the recovered 
solution. In Fig. [6] (3rd row) we average over colour to show the trend with redshift for each luminosity slice. 
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Figure A6. The logarithm of the fraction of star formation (by mass), recovered in bins up to 3.8 Gyr for galaxies of different colour, 
redshift and rest-frame r— band luminosity, analysed with the FSPS models. We only show data for regions of parameter space with a 
sufficient number density of galaxies. In practice, there are a few galaxies (less than 20 per redshift bin) outside of the coloured areas, but 
these have an insignificant impact on the recovered solution. In Fig. [6] (5th row) we average over colour to show the trend with redshift 
for each luminosity slice. 
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Figure A7. Mass-weighted metallicity (see Eq. lilt for galaxies of different colour, redshift and rest-frame r— band luminosity, analysed 
with the FSPS models. We only show data for regions of parameter space with a sufficient number density of galaxies. In practice, there 
are a few galaxies (less than 20 per redshift bin) outside of the coloured areas, but these have an insignificant impact on the recovered 
solution. Fig. [6] averages over colour to show the trend with redshift for each luminosity slice. 
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Figure A8. Inter-stellar dust absorption for galaxies of different colour, redshift and rest-frame r— band luminosity, analysed with the 
FSPS models. We only show data for regions of parameter space with a sufficient number density of galaxies. In practice, there are a few 
galaxies (less than 20 per redshift bin) outside of the coloured areas, but these have an insignificant impact on the recovered solution. 
Fig. [6] averages over colour to show the trend with redshift for each luminosity slice. 
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Figure A9. Mass-weighted age in Gyr (see Eq. 1 1 P for galaxies of different colour, redshift and rest-frame r— band luminosity, analysed 
with the BC03 models. We only show data for regions of parameter space with a sufficient number density of galaxies. In practice, there 
are a few galaxies (less than 20 per redshift bin) outside of the coloured areas, but these have an insignificant impact on the recovered 
solution. In Fig. [6] (3rd row) we average over colour to show the trend with redshift for each luminosity slice. 
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Figure A10. The logarithm of the fraction of star formation (by mass), recovered in bins up to 3.8 Gyr for galaxies of different colour, 
redshift and rest-frame r— band luminosity, analysed with the BC03 models. We only show data for regions of parameter space with a 
sufficient number density of galaxies. In practice, there are a few galaxies (less than 20 per redshift bin) outside of the coloured areas, but 
these have an insignificant impact on the recovered solution. In Fig. [6] (5th row) we average over colour to show the trend with redshift 
for each luminosity slice. 
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Figure All. Mass-weighted metallicity (see Eg. lilt for galaxies of different colour, redshift and rest-frame r— band luminosity, analysed 
with the BC03. We only show data for regions of parameter space with a sufficient number density of galaxies. In practice, there are a 
few galaxies (less than 20 per redshift bin) outside of the coloured areas, but these have an insignificant impact on the recovered solution. 
Fig. [6] averages over colour to show the trend with redshift for each luminosity slice. 
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Figure A12. Inter-stellar dust absorption for galaxies of different colour, redshift and rest-frame i — band luminosity, analysed with the 
BC03 models. We only show data for regions of parameter space with a sufficient number density of galaxies. In practice, there are a few 
galaxies (less than 20 per redshift bin) outside of the coloured areas, but these have an insignificant impact on the recovered solution. 
Fig. [6] averages over colour to show the trend with redshift for each luminosity slice. 
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Figure A13. Predicted evolution of r — i colour with redshift, 
calculated according to Eq. (0, and the solutions presented in 
Section [52] and using the M10. The different panels show exam- 
ples at different combinations of colour and luminosity. The black 
contour lines show the number density of LRGs within the respec- 
tive luminosity slice, and the coloured lines show the tracks from 
cells at four different redshifts: 0.175, 0.275, 0.375 and 0.475. The 
star shows the position of the cell the tr ack relates to. The gree n 
dashed line shows the LRG model from [Maraston et al.l l|2009f l. 
for comparison. 



Figure A14. Predicted evolution of g — r colour with redshift, 
calculated according to Eq. J9J, and the solutions presented in 
Scction l5.2l and using the M10. The different panels show exam- 
ples at different combinations of colour and luminosity. The black 
contour lines show the number density of LRGs within the respec- 
tive luminosity slice, and the coloured lines show the tracks from 
cells at four different redshifts: 0.175, 0.275, 0.375 and 0.475. The 
star shows the position of the cell the tr ack relates to. The gree n 
dashed line shows the LRG model from [Maraston et al.l l|2009f ). 
for comparison. 
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Figure A15. Predicted evolution of r — i colour with g — r, 
calculated according to Eq. J5J), and the solutions presented in 
Section l5.2l and using the M10 models. The different panels show 
examples at different combinations of colour and luminosity. The 
black contour lines show the number density of LRGs within the 
respective luminosity slice, and the coloured lines show the tracks 
from cells at four different redshifts: 0.175, 0.275, 0.375 and 0.475. 
The star shows the position of the cell the trac k relates to. The 
green dashed line shows the LRG model from Maraston ct al. 
(2009), for comparison. 



Figure A16. Predicted evolution of r — i colour with redshift, 
calculated according to Eq. and the solutions presented in 
Section 15.21 and using the BC03 models. The different panels 
show examples at different combinations of colour and luminosity. 
The black contour lines show the number density of LRGs within 
the respective luminosity slice, and the coloured lines show the 
tracks from cells at four different redshifts: 0.175, 0.275, 0.375 and 
0.475. The star shows the position of the cell the track relates to. 
The g reen dashed line shows the LRG model from lMaraston et al] 
(120091) , for comparison. 
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Figure A17. Predicted evolution of g — r colour with redshift, 
calculated according to Eq. (|9]|, and the solutions presented in 
Section I5.2I and using the BC03 models. The different panels 
show examples at different combinations of colour and luminosity. 
The black contour lines show the number density of LRGs within 
the respective luminosity slice, and the coloured lines show the 
tracks from cells at four different rcdshifts: 0.175, 0.275, 0.375 and 
0.475. The star shows the position of the cell the track relates to. 
The g reen dashed line shows the LRG model from lMaraston et al.l 
||2009| ), for comparison. 



Figure A18. Predicted evolution of r — i colour with g — r, 
calculated according to Eq. JS), and the solutions presented in 
Section 15.21 and using the BC03 models. The different panels 
show examples at different combinations of colour and luminosity. 
The black contour lines show the number density of LRGs within 
the respective luminosity slice, and the coloured lines show the 
tracks from cells at four different redshifts: 0.175, 0.275, 0.375 and 
0.475. The star shows the position of the cell the track relates to. 
The g reen dashed line shows the LRG model from lMaraston et al.l 
j2009l) , for comparison. 
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